Metadata: [GEO (NCBI) - GSE271059]
IDH_status
TLS_status
Localisation
Primary_or_Recurrent
Age
Gender
Raw data: gene expression count matrix (genes × samples)
Loading data
Cleaning data
Replacing “Not_Available” with NA
Removing duplicates: distinct()
Standardizing names
Creating new metadata columns
Age_groups
TLS_status_bin
Normalizing gene expression
cpm_normalization
quantile_normalization
Computing GEP scores
Combining normalized counts with metadata
Preparing data for PCA/downstream analysis
Baseline tables
PCA on gene expression

TLS status does not strongly separate the samples
TLS group contains highly distinct outliers

Clear and distinct separation of samples by IDH status
Outlier subsets exist


